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introduction  and  background 


Dealing  with  any  task  involving  physical  objects  in  the  three-dimensional  world 
requires  a  suitable  representation.  Computer  scientists  have  used  and  proposed  a  variety 
of  these  representations  since  the  first  attempt  to  computerize  such  an  undertaking. 
The  earliest  models,  deployed  by  pioneers  in  the  field,  maintained  an  object  as  an 
ordered  set  of  endpoints  (i  e  corners)  or  pairs  of  points  (i.e  lines  or  edges).  Though 
these  had  the  attractive  features  of  making  the  object  representation  explicit  in  a 
geometric  sense  and  of  facilitating  straightforward  display  of  the  particular  structural 
instance  and  spatial  orientation,  their  downfall  resulted  from  the  fact  that  data 
manipulations  of  most  any  sort  (and  thus  transformations  and  manipulations,  which 
effect  spatiotcmporal  changes,  and  prototype  similarity  recognition  of  unknown  objects) 
were  indeed  computationally  cumbersome 

As  carls  as  I  % \  Roberts  [  1 8 j  was  attacking  the  problem  of  machine  perception 
of  three-dimensional  scenes  involving  polyhedra.  The  broad  goals  of  this  pioneering 
work  included  the  construction  of  line  drawings  from  a  pictorial  source,  the  recognition 
of  the  three-dimensional  objects  present  in  such  drawings,  and  the  redisplay  of  these 
forms  after  performing  hidden  line  removal.  In  the  computer,  polyhedra  were 
represented  as  sets  of  line  segments,  each  corresponding  to  a  physical  edge;  the  desired 
manipulations  were  ihcn  effected  by  means  of  the  now  classic  matrix  transformations 
applied  to  vectors  in  a  homogeneous  coordinate  system  A  thorough  treatment  of  these 
techniques  is  given  by  Newman  and  Sproull  [14].  The  computational  demands  that  this 


method  imposes  even  for  simple  transformations  (eg  translation,  rotation,  and  scaling) 
in  addition  to  the  cumbersome  nature  of  attempts  at  object  recognition  have  encouraged 
researchers  to  pursue  representations  which  are  more  conducive  to  the  domain  of  use. 

By  1971,  Binford  (3)  had  developed  an  innovative  alternative  representation  for 
curved  objects.  His  scheme  was  to  model  objects  in  terms  of  "generalized  cylinders",  i.e. 
all  physical  objects  in  a  system  would  lie  transformed  in  some  way  to  a  conglomeration 
of  cylinders  of  varying  size,  shape,  and  orientation.  He  thereby  introduced  the  notion  of 
representing  all  solid  bodies  as  assimilations  of  objects  taken  from  a  small  set  of 
primitive  solids  In  Binford’s  case,  a  cylinder,  the  single  primitive,  consisted  of  a  space 
curve,  or  axis,  and  a  circular  cross  section  function  on  this  axis.  A  simple  object  could 
be  modeled  b\  choosing  a  single  space  curve  and  an  associated  cross  section  function  so 
as  to  yield  a  good  fit  with  respect  to  the  shape  and  dimensions  of  the  surface,  and 
complex  objects  could  then  be  a  composite  of  generalized  cylinders.  Of  course,  a 
sacrifice  was  made  in  exchange  for  this  simplified  model  --  exact  representation  became 
impossible,  and  it  was  highly  probable  that  the  mappings  of  objects  into  the 
representation  were  not  unique,  making  recognition  difficult 

Shortly  thereafter,  two  independent,  assembly-oriented  systems  evolved.  That  of 
Braid  [4]  was  based  on  a  collection  of  six  primitive  solids  --  cuboids,  wedges,  tetrahedra, 
cylinders,  sectors,  and  fillets  Voelcker  [19]  based  his  representation  system  on  cuboid 
and  cylinder  primitives  The  idea  of  using  generalized  cylinders  or  cones  to  approximate 
parts  of  objects  after  segmenting  them  into  appropriate  pieces  has  been  further 
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developed  by  Marr  and  Nishihara  [12]  A  discussion  of  various  other  approaches  to 
three-dimensional  geometric  modeling  m  a  visual  system  (such  as  by  statistical  world 
models,  procedural  knowledge,  or  semantic  knowledge)  is  presented  by  Baumgart  |2). 

Each  of  these  representations  has  several  drawbacks,  the  major  ones  are 
difficulty  of  computational  manipulation,  difficulty  of  object  recognition,  and  an 
incompatibility  between  the  representational  form  of  the  object  and  the  object 
description  available  as  "input"  In  this  light,  a  representation  which  facilitates  ease  of 
manipulation  and  recognition  and  which  is  a  "natural"  form  for  the  object  seems  to  be  a 
most  desirable  attainment  It  is  these  qualities  which  the  enhanced  spherical  image  (ESI) 
representation  m  mitains 

In  simplest  terms,  an  ESI  model  for  a  convex  object  consists  of  a  set  of  rectors. 
An  individual  sector’s  direction  component  represents  the  direction  of  a  surface  normal 
of  the  object,  and  the  vector's  magnitude  signifies  the  object’s  surface  area 
corresponding  to  the  particular  normal  direction.  Alternatively,  considering  each  vector 
of  the  set  to  he  of  unit  length  and  locating  them  all  at  a  common  point  of  application,* 
the  locus  of  points  consisting  of  their  endpoints  lies  on  the  surface  of  a  unit  sphere. 
Such  a  mapping  was  developed  by  Gauss  and  is  known  as  a  spherical  image.  (6)  A 
scalar  value,  which  represents  the  surface  area  corresponding  to  the  normal  vector,  may 
then  lie  tagged  to  each  point.  This  model  of  an  object  has  been  dubbed  an  enhanced 

1  from  differential  geometry.  »  vector  in  l-ipoce  conmti  of  too  point!  In  J-lpoce  it!  voctor 
port  v  end  iti  point  of  opplicetion  P  The  vector  oiy  then  be  pictured  01  the  errow  trm  the 
point  P  to  the  point  V»P  [IS] 


spherical  image. 
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Representational  advantages 

With  the  success  of  several  computerized  techniques  for  determining  object 
shapes  from  photographic  sources  -•  such  as  photometric  stereo  [21),  which  uses  two  or 
more  images  of  the  same  scene  taken  with  varying  directions  of  incident  illumination, 
and  so-called  "shape  from  shading"  algorithms  [7,  8]  which  determine  orientations  by 
augmenting  gradient  space  schemes  with  image  intensity  information  -  digitized 
representational  data  may  be  produced.  To  organize  this  information  into  an  easily 
manipulated  and  thus  useful  form,  it  needs  to  be  encoded  into  a  complementary  object 
representation  I  he  shape  determination  process  actually  resolves  the  object  surface 
normal  at  each  point  in  the  scene  With  this  knowledge,  the  surface  area  of  the  various 
regions  which  compose  an  object  may  be  easily  computed  (Note  that  the  normal 
vectors’  directions  determined  for  patches  of  a  single  planar  region  are  identical  to 
within  a  reasonable  degree  of  certainty.)  A  single  vector  may  then  be  used  to  represent 
the  direction  in  which  a  flat  surface  is  facing.  Pairing  this  vector  with  the  region's  area 
and  doing  this  for  all  of  an  object's  regions  (i.c.  faces)  will  yield  an  ESI  representation. 

The  question  that  immediately  arises  concerns  the  case  in  which  some  faces  of 
an  object  are  not  visible  to  the  shape  determining  process.  Certainly  the  observable 
regions  may  be  accurately  represented  Under  such  circumstances,  a  useful  property  of 
ESI's  may  then  be  employed,  specifically  that  referred  to  as  the  center  of  mass  property. 
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(Sec  pace  28  in  Appendix  II.)  With  this,  an  object’s  description  may  be  checked  for 
completeness  ("wholeness");  and  if  a  region  is  missing,  the  appropriate  direction  and  area 
needed  to  fill  in  the  hole  may  be  calculated  and  appended  to  the  object’s 
representation  * 

Once  an  object  has  been  represented  in  this  fashion,  the  commonplace  object 
manipulations  become  trivial  to  execute.  It  is  assumed  here  that  the  representation  is 
maintained  as  a  set  of  vectors,  all  of  which  have  a  common  point  of  application  (a  local 
origin  in  Cartesian  space)  and  whose  vector  parts  are  specified  in  spherical  coordinates 
relative  to  this  local  origin  Translations  do  not  effect  ESI  representations.  Rotations 
about  the  point  of  application  are  effected  by  adding  the  appropriate  directional  offset 
to  the  two  components  of  each  vector’s  direction  specification  or,  equivalently, 
maintaining  a  base  vector  which  would  be  added  to  each  vector  on  any  reference  to  any 
vector’s  direction  de  rotating  the  sphere)  *  Rotations  about  an  arbitrary  axis  in  the 
Cartesian  sp.ice  may  then  be  effected  by  a  rotation  about  the  local  origin  followed  by 
the  appropriate  translation  To  do  scaling,  the  area  associated  with  each  ESI  point 
would  !>e  multiplied  by  the  square  of  the  desired  scale  factor  ••  which  could  also  be 
maintained  in  a  base  vector.  These  methods  arc  significantly  less  computationally 
demanding  than  would  be  the  case  for  a  point-edge  representation  of  an  object. 

1  With  r«tp*cl  to  r*ro«»truct  ion.  howevrr.  it  U  mavitabt*  that  on  object's  facat  which  ira 
adjacmt  to  l ho  contrived  face  will  bo  io»rvhat  d*form*d  10  |hap*  if  »or«  th»n  «  llngta  planar 
lurfaco  wot  nutting  from  the  dricription 

2  It  "*o y.  hootrtr.  bt  timplar  to  r*prrt*nt  th*  vector  parti  of  tb»  (SI  point!  In  Cartotlon 
coordinate!  to  that  a  J  »  J  orthonoriaal  melon  may  bt  ul«4  to  aiamtam  tha  rotation  information 


Reconstruction 


An  ESI  representation  of  an  object  is  not  in  itself  "viewable"  or  "ciisplayable". 
In  fact,  it  is  not  clear  that  there  is  a  straightforward  method  for  obtaining  a  point-edge 
representation  of  an  object  from  its  ESI.  What  is  needed  is  a  method  to  locate  the 
planes  which  compose  the  object  faces  at  the  proper  distance  from  a  center  point.  The 
vector  direction  specifies  the  direction  for  the  normal  vector  of  the  plane,  while  the 
intersection  of  adjacent  planes  must  define  a  region  of  the  proper  area.  Techniques 
have  ahead)  been  developed  for  generating  an  object  from  a  set  of  intersecting  planes. 
(22) 

A  recursive  algorithm  for  reconstruction  has  been  suggested  but  not 
implemented  (See  page  35  in  Appendix  II.)  The  proposed  proof  of  correctness  has 
been  shown  to  have  .in  error:  a  pair  of  ESI  points  may  be  arbitrarily  close  and  yet  not 
represent  adjacent  faces  of  a  polyhedron.  One  should  note,  however,  that  this  does  not 
prose  the  algorithm  itself  to  l>e  incorrect. 

An  iterative,  reconstructive  algorithm  has  also  been  envisioned  though  there 
remains  considerable  uncertainty  about  its  convergence.  One  may  conceive  of  sliding 
planes  in  and  out  from  a  common  point  of  application  along  the  ray  coinciding  with 
their  normal  vectors,  trying  to  adjust  each  plane's  position  so  as  to  produce  polyhedral 
faces  with  the  required  areas.  Specific  techniques  have  not  been  investigated. 


One  advantage  of  the  ESI  representation  is  a  simple  method  of  hidden  line 
elimination  Once  an  object  has  been  reconstructed,  its  surfaces  and  edges  have  been 
determined.  Then  only  those  faces  which  correspond  to  ESI  points  that  lie  in  the 
hemisphere  of  the  unit  sphere  which  is  observable  by  the  viewer  would  be  made 
manifest  in  the  actual  display.  Conveniently,  these  faces  are  exactly  those  that  the 
viewer  would  see  if  he  were  examining  the  three-dimensional  object,  while  the  points 
from  the  opposite  hemisphere  are  the  portions  of  the  object  which  the  viewer  would  be 
unable  to  see  from  his  current  observation  position 

Recognition 

Another  intended  advantage  of  modeling  objects  by  their  spherical  images  is 
that  one  can  easily  recognize  shapes  by  comparing  their  representations  For  polyhedra, 
the  task  may  be  pictured  as  that  of  trying  to  rotate  two  concentric  spheres  of  unit 
radius  such  that  the  dots  on  their  surfaces  match  up  (i.e  for  each  dot  on  one  sphere 
there  is  a  dot  on  the  other  sphere  in  the  same  position).  Since  the  dots  may  be  of 
various  "intensities"  or  "masses"  (recall  that  the  area  is  encoded  in  some  fashion),  these 
values  also  need  to  agree  for  there  to  lie  a  match. 

The  problem  involves  three  degrees  of  freedom  (two  for  the  axis  direction  plus 
one  for  the  rotation  position  about  the  axis)  and  is  equivalent  to  holding  one  sphere  in  a 
fixed  orientation  while  matches  are  attempted  by  rotating  the  other  sphere  about  many 
different  axes.  One  may  envision  various  clever  schemes  for  implementing  this 
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trial-aml-error  approach  to  the  matching  process,  such  as  only  trying  those  axes  which 
pass  through  ESI  points,  rotating  the  sphere  to  match  one  pair  of  points,  and  then 
comparing  the  other  points  To  determine,  however,  that  two  representations  of  N 
points  each  are  not  the  same  type  of  object,  such  a  scheme  would  try  on  the  order  of 
N"  different  positions  of  the  sphere  and  would  require  comparing  N  points  for  each. 

If,  on  the  other  hand,  a  canonical  orientation  could  be  found  for  each  sphere, 
then,  b\  first  aligning  them  according  to  their  characteristic  orientations,  only  one 
relative  position  of  the  spheres  need  l>e  tested  and  only  N  pairs  of  points  need  be 
checked  to  determine  a  match  To  develop  such  a  promising  technique,  it  is  convenient 
to  regard  the  ESI  points  as  point  masses,  the  representation  of  a  polyhedron  thus 
becoming  a  system  of  point  masses.  Obviously,  the  problem’s  three  degrees  of  freedom 
are  eliminated  if  two  axial  directions  on  a  sphere  arc  fixed.  Moments  of  inertia  for 
physical  bodies  are  determined  with  respect  to  a  given  axis,  so  one  naturally  thinks  of 
the  axes  of  minimum  and  maximum  moment  of  inertia  ~  with  the  added  restriction  that 
the  axes  must  pass  through  the  center  of  mass  of  the  body.  The  problem  of  finding  a 
characteristic  orientation  for  an  ESI  may  then  be  viewed  as  that  of  discovering  the  axes 
about  which  a  system  of  point  masses  would  have  the  minimum  moment  of  inertia  and 
the  maximum  moment  of  inertia. 

Ihts  lask  is  known  in  Lagrangian  dynamics  as  that  of  determining  the  principal 
moments  of  inertia  [13]  The  techniques  as  applied  to  the  problem  discussed  above  are 
developed  completely  in  Appendix  I.  Using  those  results,  it  is  possible  to  assign  a 
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unique,  characteristic  orientation  to  any  ESI  representation  ^ 

Recognition  by  principal  axes  alignment 

I  he  first  step  in  reccgnmng  that  an  object  matches  a  known  prototype  is  to 
align  the  two  spheres  so  that  their  characteristic  axes  coincide.  The  next  step  involves 
determining  whether  a  true  match  of  the  ESI  points  exists  In  the  case  where  the  object 
representations  are  guaranteed  to  be  exactly  correct  (eg  in  an  artificial  or  synthesized 
example),  the  points  must  coincide  perfectly  and  haxe  identical  masses  (surface  area 
values).  Matching  becomes  trivial  In  more  realistic  situations,  however,  one  might 
exj>oct  that  a  match  should  still  occur  e\cn  if  some  of  the  points  are  slightly  amiss  in 
location  or  mass  This  would  be  especially  true  if  the  unknown  object’s  representation 
was  the  result  of  processing  real  world  data,  such  as  a  photograph  The  matching 
algorithm  must  then  incorporate  some  tolerance,  the  amount  of  which  could  perhaps  be 
dependent  on  the  known  uncertainty  associated  with  the  shape  determination  process. 

At  this  point,  only  a  couple  of  speculative  suggestions  can  be  mentioned  First, 
it  can  be  noteil  that  if  an  exact  match  exists,  then  the  alignment  of  principal  axes  will 
pair  F.SI  point*  without  any  discrepancy  --  provided  unique  principal  axes  exist  This 

1  In  general.  a  single  unique  orientation  may  be  deteranned.  but  for  objects  which  exhibit  t  high 
degree  of  Symmetry.  two  or  more  equivalent  orientation  frames  may  be  possible  (Consider,  for 
e  •  amp  i  e .  tn*  »«»i  about  which  a  cube  has  the  ma»>m^  moment  of  inertia  )  This  does  not  i  eem  to  post 
•ny  problem  to  the  matching  process,  because  each  of  thtie  possible  orientations  is  equivalently 
useful  t n  simply  choosing  one.  a  correct  match  will  still  result,  since  the  match  is  based  on 
shape  and  s i •  e  atone  a  cube  has  no  unique  "top*  side  Yet,  a  regular  tetrahedron  is  an  avcaption 
to  this  general  idea  It  allows  »iny  matches  which  are  equally  good  based  an  moments  of  inartla. 
but  not  all  of  which  iff  genuine 
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means  that  in  no  case  will  it  be  possible  to  slightly  rotate  one  sphere  to  produce  a 
su|>erior  match  Howeser,  in  a  situation  where  an  exact  match  is  not  obtained,  it  is  not 
Certain  that  alignment  of  principal  axes  will  produce  the  "best"  or  "closest"  match.  If  it 
is  found  that  a  slight  rotation  in  some  situations  would  result  in  a  better  match,  a 
technique  known  as  chamfer  matching  seems  ideally  suited  for  use  in  determining  the 
best  refinement  to  make  [1]  A  pictorial  explanation  of  how  chamfer  matching  works 
would  be  to  \iew  each  ESI  point  on  one  sphere  as  being  down  in  an  inverted  cone 
shaped  valley  The  |>omts  on  the  other  sphere  wish  to  roll  down  into  the  valley  as  if  by 
gravity,  but  the  system  of  points  is  rigidly  connected.  The  matching  finds  the  most 
agreeable  compromise 

In  either  case,  the  remaining  issue  is  that  of  determining  the  relative  goodne ss  of 
a  match  It  is  probable  that  one  would  wish  to  recognize  an  object  mostly  by  its  shape, 
rather  than  its  si/e  (Should  a  little  cube  "match"  a  big  cube1)  This  means  that  the 
difference  m  location  between  ESI  points  is  significant;  whereas,  it  is  the  ratio  of  masses 
for  each  pair  of  points  that  one  should  examine.  Potentially  appropriate  measures  would 
be  the  root-mean-squarc  of  the  differences  in  position  between  paired  points  and  the 
standard  deviation  of  the  mass  ratios  of  point  pairs 

I  his  manner  of  recognizing  objects  seems  quite  promising  for  objects  all  of 
whose  features  are  known  Unfortunately,  from  a  single  view  (as  in  a  photograph)  it  is 
usually  the  case  that  parts  of  the  object  arc  unobservable  and  must  be  guessed  at.  The 
ESI  representation  itself  aids  m  filling  in  missing  pieces,  but  any  attempt  to  identify  such 
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an  object  by  the  recognition  method  just  described  would  be  hopeless  in  all  but  a  few 
luck)  cases  At  present,  the  only  method  envisioned  for  recognizing  such  objects  would 
involve  preserving  the  hemisphere  of  valid  points  and  then  trying  to  apply  a  similar, 
though  appropriately  modified,  method  using  them  alone  -*  determining  principal  axes, 
etc  The  prototypes  against  which  the  match  was  being  done  must  also  be  "halved."  Of 
course,  one  need  only  consider  hemispheres  which  contain  the  desired  number  of  points. 
This  technique  results  in  many  more  matches  being  tried  (on  the  order  of  the  number 
of  points  in  the  prototype). 

In  the  preceding  discussion  of  this  method  of  recognition,  it  has  been  implicitly 
assumed  that  the  representation  of  the  object  is  at  least  not  too  grossly  in  error  and, 
specifically,  that  each  represented  object  face  corresponds  to  exactly  one  real  face  of  the 
object  being  modeled  If,  however,  the  shape  determination  process  reports  a  face  that 
docs  not  actually  exist  or  if  it  fails  to  detect  a  face  that  is  actually  present,  then  the 
resulting  ESI  will  have  more  or  fewer  points  than  it  should  In  such  a  situation,  it 
seems  that  the  proposed  recognition  algorithm  would  fail  miserably  due  to  its  heavy 
reliance  on  matching  point-for-point  Yet  hypothesizing  that  the  shape  determination 
algorithm  would  only  produce  such  a  mistaken  result  for  relatively  small  surface  areas  of 
the  object  and  assign  to  them  an  approximately  correct  orientation,  it  might  be  the  case 
that  the  characteristic  orientation  of  an  erroneous  model  would  differ  only  slightly  from 
that  of  the  true,  accurate  model  In  such  a  situation,  if  the  two  models  were  aligned  by 
the  orientations,  then  all  but  the  false  ESI  point(s)  would  be  closely  paired.  One  would 
expect  the  points  of  the  inaccurate  model  which  lie  closest  to  the  false  point  (or  location 
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of  the  "missing"  jx^mt >  to  deviate  more  from  their  correct  positions  than  other,  more 
distant,  points  Noting  such  an  occurrence  during  the  matching  process,  the  absence  or 
presence  of  .hi  extra  point  might  potentially  be  detected,  and  the  correct  match  could 
then  result  Incorporating  a  detection  scheme  of  this  sort  into  the  recognition  process 
would,  however,  eliminate  part  of  the  screening  procedure  proposed  for  determining 
those  representations  which  should  be  used  in  the  comparisons  Specifically,  if  one  is 
seeking  to  identify  an  1  Si  which  consists  of  N  points,  then  in  the  naive  manner  one 
would  only  examine  the  proton,*  models  which  consist  of  N  points.  In  dealing  with 
errors  that  effect  the  number  of  presumed  faces,  it  would  then  be  necessary  to 
additionally  check  me  models  consisting  of  N+l,  N-l,  N+2,  N-2,...,  N+i,  and  N-i 
points,  where  i  is  the  maximum  number  of  potentially  erroneous  faces  (perhaps  a 
fraction  of  N,  though  hopefully  a  small  number). 

Representing  smooth  or  non  convex  objects 

I  he  I  Si  representation  and  techniques  developed  thus  far  arc  best  suited  for 
application  to  convex  polyhedra  In  trying  to  extend  the  domain  to  non-convex  objects 
or  smooth-surfaced  obiects,  several  difficulties  arise  For  instance,  the  ESI  for 
non-convex  obiccts  is  not  unique,  i.e.  there  is  not  a  one-to-one  mapping  between  a 
non-convex  object  and  its  ESI  (See  page  37  in  Appendix  II.)  Then  too,  the  spherical 
image  of  smooth  objects  does  not  consist  solely  of  points,  as  does  that  of  polyhedra.  (6) 
In  fact,  an  object  which  has  no  planar  surfaces  would  have  a  spherical  image  that  would 
be  distributed  over  every  part  of  the  sphere  with  varying  "density"  or  "intensity" 
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providing  the  sh.i|>c  information  The  integral  of  this  density  over  a  patch  on  the 
Ci.uisM.in  sphere  equals  the  total  area  with  surface  normal  with  direction  falling  within 
the  patch 

I  hough  it  may  lie  feasible  to  represent  smooth,  convex  objects  by  their  true 
spherical  images,  no  techniques  for  doing  so  have  been  conceived  Instead,  the  approach 
has  been  to  employ  polyhedral  techniques.  This  necessitates  approximating 
smooth-surfaced  objects  by  poljhedra  and  resolving  the  difficulties  associated  with 
representing  non-convex  polvhedra  Expedient  methods  for  approximating  complex 
surfaces  In  planar  faces  have  been  developed.  [10]  They  generally  produce  a  somewhat 
optimal  friangtilufion  of  the  surface,  in  other  words,  the  approximating  polyhedron  for  a 
smooth  object  would  consist  of  triangular  shaped  faces  This  is  quite  acceptable  for  the 
intended  purposes  ' 

Onlv  an  inelegant  solution  has  been  envisioned  to  circumvent  the  ambiguities 
associated  wnh  non-convex  polvhedra  This  entails  using  existing  algorithms  for 
subdividing  the  objects  into  convex  components,  keeping  track  of  how  the  pieces  "fit 
back  together"  somehow-  (*>]  Since  each  piece  is  then  a  convex  polyhedron,  it  may  be 
represented  in  the  normal  fashion  as  an  ESI.  Reconstruction  would  involve  first 
reconstructing  each  component  part  and  then  joining  the  pieces  back  together  in  the 


1  th*  thppe  <ieter»i"pi  ,o"  prpcemng  "ay  already  output  »p*c  if  lc»t  loot  of  lh»  lurfoc*  normal  for 
m»r>f  mot)  region  of  p  iv'fid  Ihia  teu'O  t>>»»  be  r rga-ded  ot  a  many  aided  polyhedral 
•  open* 'not  ion  to  t*>*  aurfoc*.  thua  foregoing  the  t  r  >  pngut  ot  ion  proceaa  mentioned  abou*  On*  ahould 
"Ot*  th»t  't  wnu'd  "ot  b*  "ecriaary  to  determine  the  edgea  between  eech  anetl  region,  though  th« 
aurfpc*  pr**  itionatld  with  etch  would  be  needed  (In  actuellty.  thp  littl*  Iwrfpc*  petchea  n*y 
not  notch  up  erectly  Pt  thpir  *Og«t,  to  t*>ey  would  not  b(  defining  p  *rppt*  polyhedron  ) 


projier  in. inner  (<>)  However,  it  should  be  noted  that  this  process  would  yield  a 
polyhedron.  If  .t  smooth  object  is  desired,  a  smoothing  operation  would  be  inundated, 
though  .m  exact  reconstruction  of  the  original  object  would  probably  be  impossible. 
Little  extra  difficulty  would  result  for  performing  spatial  manipulations  on  such  an 
object,  all  of  the  component  parts  would  just  be  treated  as  an  indivisible  set  -•  operators 
being  applied  to  a  base  vector  for  the  complete  object. 

Recognition  of  non-convex  objects  might,  however,  present  more  of  a  difficulty. 
For  instance,  one  cannot  even  lie  confident  that  the  algorithm  for  subdividing  an  object 
into  convex  components  would  slice  two  identical  objects  in  the  same  way,  because  in 
general  there  is  no  unique  set  of  slicings  to  produce  convex  parts  for  any  given 
non-com  ex  object  If  a  division  algorithm  is  developed  that  w  ill  always  yield  the  same 
components  for  a  gnen  object,  the  matching  process  would  probably  become 
manageable  In  fact,  before  attempting  to  match  shapes  of  component  pieces,  the 
algorithm  could  first  try  to  recognize  an  object  by  the  number  of  pieces  and  their 
spatial  relations  to  each  other 

Conclusion 

This  paper  has  presented  the  current  state  of  development  for  an  object 
representation  using  enhanced  spherical  images.  The  representation  of  convex  polyhedra 
is  wrll  understoixl,  while  methods  for  determining  point-edge  representations  from  their 
ESI  forms  are  less  so  Spatial  manipulations  are  a  promising  advantage  of  the  ESI 
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model,  as  is  object  recognition.  The  methods  for  matching  two  ESl’s  have  been 
investigated,  though  some  specifics  need  more  attention.  Smooth-surfaced  and 
non-convex  objects  present  considerable  challenges  which  must  necessarily  be  overcome 
for  the  ESI  representation  to  be  practical  and  useful.  A  first  attempt  at  handling  these 
difficulties  involves  approximating  the  objects  by  one  or  more  polyhedra,  and  even  this 
idea  offers  additional  perplexities 

Significant  progress  has  been  made  towards  a  complete  system  of  support  for  the 
ESI  representation  of  objects,  but  much  work  lies  ahead  if  the  techniques  are  to  ever  be 
utilized  in  realistic  applications. 


Appendix  I 


Determination  of  Principal  Moments  of  Inertia 

The  principal  moments  of  inertia  for  a  system  of  point  masses  will  be 
determined  here  symbolically.  [20]  Throughout  this  discussion,  the  Cartesian  coordinates 
of  a  |K>int  will  be  expressed  as  the  triple  (xj,  yp  Zj),  and  the  mass  will  be  referred  to  as 

mj 

For  any  system  of  point  masses,  its  moments  of  inertia  about  the  coordinate  axes  —  X, 
Y,  and  7.  --  may  be  computed  as  follows: 

I*x  =  £  (>.2  *  zi2)  ,ni 

Iyy  =  1  U,2  ♦  X2)  HI, 

hi  "  *  <x.2  +  >'i2)  mi 

Likewise,  its  products  of  inertia  may  be  found: 

Vxy  “  Kx  “  £  xi'i,ni 
hi  =  hx  =  *  x.zimi 
lyz  hy  ~  E  >’i7imi 

In  general,  the  inertia  matrix  (inertia  tensor)  has  the  form: 
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There  is  usualh  ;i  unique  orientation  of  axes  X-Y-Z  at  a  given  origin  for  which  the 
products  of  inertia  vanish. 
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This  coordinate  frame  defines  the  principal  axes  of  inertia,  and  the  corresponding  lxx, 
Ivv,  and  lrl  are  called  the  principal  moments  of  inertia  They  represent  the  maximum, 
minimum,  and  an  intermediate  value  of  the  moments  of  inertia 

Fite  principal  moments  of  inertia  are  the  eigenvalues  of  the  inertia  matrix.  I  he 
solution  of  the  determinant  equation 


l.xx-1 

-'xy 

_IXZ 

*yx 

"V* 

*zx 

-I*y 

lu~ 

0 


for  I  yields  three  roots  I j.  Is,  Ij  of  the  resulting  cubic  equation.  (These  are  the 
eigenvalues.) 

The  direction  cosines  /,  m,  n  of  the  principal  inertia  axes  are  given  by 
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I'xx-W  ‘  Ixym  *  *xz"  *  0 
-Iyx/  ♦  (lyy-l)M  -  I  yZn  =  0 

-!„/  -  Izvm  ♦  (Izz-l)n  =  0 

These  equations  along  with  /*  +  +  tr  =  1  enable  a  solution  for  the  direction 

cosines  to  be  made  for  each  of  the  three  roots  (Ij,  I2*  1 3)  separately. 

From  the  inertia  matrix, 


**xx“*Xlyy-l)(Izz-l)  +  ^~*xyX“*yzX"*zx) 

+  (-JjiZH-IyX)(-lZy)  -  (“JZx)^yy“l)(-I],z) 

-  (-IzyX-IyxXlxx-I)  -  (Izz"I)("Iyx)(_Ixy)  =  0 

which  may  Iv  expanded  to 
-l’ 

+  **xx  *  *yy  +  *zz** 

+  <-*x\*yy  ”  *xx*zz  “  *yy*zz  +  *xz^  *  *yz^  +  *xy  X 
+  dSx*vy^zz  “  ^*xy*xz*yz  "  *yy*xz^  ”  *xx*yi^  ”  *zz*xy^  ~  ® 

One  met ho<l  for  solving  such  a  third  order  algebraic  equation  is  that  of  Graeffe's 
root-squaring  method  (51  although  direct  methods  for  solving  cubic  equations  are  also 
known* 


I  Intuitively.  onf  would  tipfct  Old  th»  throe  rootl  would  mmunlj  olooyi  bo  rodl  nioobero, 
liner  they  do  rtruurl  pi>/iuil  Mm'ili  of  morlio  ond  or*  the  roiwlt  of  computation  involving  only 
root .  poiMivt  quoMitivi  fhi»,  hewovor,  hot  not  bton  vtrlfiod  by  analytical  proof 


Once  the  three  principal  moments  of  inertia  are  determined,  they  may  be  used 
to  compute  the  direction  cosines  of  the  corresponding  axes,  thus  giving  the  characteristic 


directions  for  the  system  of  point  masses 


(,xx_,)/  -  lxsm  -  ,x/'1  *  0 


—  IXy  /  ♦  (Iyy-I)WI  -  Iy  ,/»  «  0 


-\J  -  \ylm  +  Uu-\)n  =  0 


Algebraically  solving  these  equations  produces 


/  =  <  1  ♦  Q2  ♦  R2)^ 


(i«-i . 

*xz  1  w_ 
_,vz  W 


L.-l  L. 

(Tf-  - 
‘xv  ‘vz 

n  ~  i  -i  r 

.  lzr  xz  , 

<TI - T~) 

•yz  xy 

where 


rv  -  4  * w*  ~  * xy_| x: 

“^xz^yy  +  *xz*  “  ^xy^yz 


LJ..  -  I.. I  ♦  L.I.. 
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As  .1  final  comment,  it  should  be  noted  that  the  computational  method 
presented  here  for  determining  principal  axes  is  well-suited  for  application  to  point  mass 
systems  which  do  not  exhibit  a  high  degree  of  symmetry.  Special  treatment  would  be 
warranted  for  cases  in  which  any  of  the  products  of  inertia  vanishes.  Conceptually,  such 
a  situation  might  l>e  remedied  simply  by  rotating  the  system  slightly  and  recomputing 
the  moments  and  products  of  inertia. 
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Introduction 


The  representation  of  three-dimensional  objects  in  the  realm  of  digital  computers  is  a 
fundamental  keystone  to  several  current  pursuits  In  the  field  of  Artificial  Intelligence,  such 
as  computer  vision,  spatial  reasoning,  and  manipulation.  Straightforward  descriptions  (e  g., 
representing  an  object  by  Its  vertices,  edges,  or  surface  patches)  suffer  from  the  large 
requirements  of  data  storage  and/or  of  computation  time  for  data  processing.  For  example, 
to  perform  a  single  rotation  of  a  point-represented  object  about  an  arbitrary  axis  using 
matrix  multiplication  calculates  384  ♦  16n  products,  where  n  Is  the  number  of  points  1 14). 
This  representation  turns  out  to  be  quite  expensive  when  methods  of  object  recognition 
depend  on  repeatedly  rotating  prototype  objects  during  some  matching  process  In  this 
paper,  an  alternative  representation  for  three-dimensional  objects  shall  be  discussed  which 
essentially  depicts  an  object  by  Its  surface  normals.  This  will  be  the  Enhanced  Spherical 
Image  representation 

Spherical  Images 

The  properties  of  surfaces  In  which  we  are  Interested  include  the  area,  the 
Inclination  with  respect  to  some  standard  direction,  and  the  curvature  The  latter  may  be 
characterized  by  two  numbers,  namely  the  principal  curvatures.  These  may  he  thought  of  as 
the  rate  of  change  of  the  surface  normal  directions  In  each  of  two  perpendicular  directions 
(In  the  tangent  plane).  Alternatively,  the  curvature  at  a  point  may  be  represented  by  a 
single  number  using  a  method  originated  by  Gauss.  This  number,  of  course,  will  be  a  function 
of  the  principal  curvatures  and  Is  in  fact  their  product  It  is  known  as  the  Gaussian 
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curvature  [6,  lb]. 

In  his  studies  of  differential  geometry,  Gauss  developed  a  process  for  mapping  a 
surface  onto  a  sphere  This  transformation  consists  of  translating  the  various  outward  unit 
normals  of  a  surface  to  the  center  of  a  unit  sphere.  The  points  on  the  sphere  which  are  at 
the  end  points  of  these  vectors  are  then  the  spherical  representation  of  the  surface,  lo,  a 
apherlca/  image  It  should  be  noted  that  this  mapping  assigns  a  definite  point  on  the  sphere 
to  every  point  of  the  surface 

Throughout  the  remainder  of  this  paper,  the  sphere  on  which  a  spherical  image  Is 
formed  shall  be  called  the  Gaussian  sphere  Shown  below  are  some  simple  geometric 
objects  and  sketches  of  their  associated  spherical  images 


Polygons  and  Polyhedra 

Except  for  the  concluding  section,  this  paper  shall  now  mainly  deal  with  the  simplified 
case  of  representing  polyhedra.  Therefore,  a  few  of  the  basic  properties  of  polygons  and 
polyhedra  will  be  presented  here  to  standardize  our  terminology!  I  1  ]. 

A  polygon  Is  a  two-dimensional  figure  whose  sides  are  straight  line  segments.  A 
polygon  Is  said  to  be  convex  if  it  entirely  contains  all  segments  connecting  any  two  of  its 
points.  Thus,  the  interior  angles  (those  facing  the  inside)  of  a  polygon  are  all  less  than  180 
degrees.  A  polygon  is  uniquely  determined  by  its  Interior  angles  and  the  lengths  of  its  sides, 
though  this  Is  not  the  case  if  only  its  angles  and  its  area  are  specified.  (Consider,  for 
example,  a  2x3  rectangle  versus  a  1  x6  rectangle.) 

A  polyhedron  is  a  three-dimonsional  figure  whose  faces  are  polygons.  Two  facte*  of  a 
polyhadron  are  adjacent  if  they  have  a  common  edge  A  polyhedron  Is  closed  if  Its  faces  are 
convex  polygons  and  all  of  its  edges  specify  a  pair  of  adjacent  faces,  l.e.,  there  are  no 
"holes"  In  the  polyhedron.  A  convex  polyhedron  is  one  which  entirely  contains  all  line 
segments  connecting  any  two  of  its  points  The  angle  between  any  two  adjacent  faces  of  a 
convex  polyhedron  is  less  than  180  degrees.  Thus,  no  two  faces  may  have  normals  in  the 
same  direction. 

Future  references  to  polygons  and  polyhedra  should  be  understood  to  mean  convex 
polygons  and  convex  and  closed  polyhedra,  unless  otherwise  stated 


In  1807,  H.  Minkowski  proved  the  following  theorem  which  reveals  how  polyhedra  may 


be  uniquely  specified 

Theorem  of  Minkowski. 

If  each  face  of  one  convex  polyhedron  corresponds  to  a  face  of 
another  convex  polyhedron  such  that  the  two  faces  are  equivalent  (of 
equal  area)  and  have  parallel  outer  normals,  and  conversely,  then  the  two 
polyhedra  are  equal  and  parallel 

Which  la  to  say  that  a  polyhedron  is  determined  uniquely  by  the  areas  and  directions  of  Its 
faces  This  concept  may  now  be  combined  with  that  of  spherical  Images  to  develop  an 
efficacious  representation  for  three-dimensional  objects. 


Enhanced  Spherical  Images 


The  mapping  described  abovo  for  generating  a  spherical  image  does  not  associate  a 
unique  point  on  the  Gaussian  sphere  with  each  point  of  an  object,  as  may  easily  be  verified 
by  considering  any  body  some  of  whose  surface  normals  are  parallel  --  one  with  a  planar 
surface,  for  example  In  the  case  of  polyhedra,  the  number  of  normals  which  map  into  the 
same  point  on  the  Gaussian  sphere  is  a  measure  of  the  surface  area  of  the  corresponding 
face.  This  area-related  data  may  be  preserved  by  forming  an  enhanced  spherical  image 
(ESI) 


Definition:  Enhanced  Spherical  Image 

An  Enhanced  SpharlcaJ  Imagu  Is  a  spherical  representation  In  the 
atandard  sense  (as  described  above)  which  has  a  numeric  quantifier, 
"Intensity",  associated  with  each  point  of  the  Image  Ihe  value  of  this 
Intensity  corresponds  to  the  total  surface  area  of  the  represented 
object  which  has  mapped  onto  this  point  of  the  Gaussian  sphere 


The  following  are  some  examples  of  convex  potyhedra  and  their  associated  ESI'a. 


Uniqueness  of  an  Enhanced  Spherical  Image 

The  utility  of  a  depiction  for  three-d-mensional  bodies  would  be  questionable  if 
objects  were  not  guaranteed  to  be  represented  uniquely  Proof  of  the  following  theorem  Is 
thus  essential  to  the  development  of  the  ESI  representation 

Theorem  of  Uniqueness. 

The  ESi's  of  convex  potyhedra  are  unique.  I.e.,  the  mapping 
between  a  convex  polyhedron  and  Its  ESI  Is  one-to-one. 


Proof 

Part  1:  A  convex  polyhedron  maps  Into  one  and  only  one  ESI. 

This  may  be  shown  by  observing  that  the  spherical  mapping  is  a  composition  of 
one-to-one  mappings  We  begin  by  recalling  that  the  outward  normals  of  a  surface  are 
necessarily  parallel  If  the  surface  Is  planar  (as  la  the  case  with  faces  of  a  polyhedron). 
Letting  each  of  these  normals  assume  unit  length,  we  may  translate  (a  one-to-one  function) 


each  of  them  to  the  center  of  a  unit  sphere.  Since  they  all  have  the  same  direction,  they 
map  Into  a  singln  point  on  the  sphere's  surface  (one-to-one)  The  surface  area  of  one  face 
of  a  polyhedron  Is  a  fixed  constant,  and  once  associated  with  the  point  In  the  spherical 
image,  a  point  in  the  ESI  is  formed  This  process  may  be  repeated  unambiguously  (since  no 
two  faces  have  normals  In  the  same  direction)  for  each  face  of  the  polyhedron,  producing  a 
unique  ESI. 

Part  2:  An  ESI  of  a  convex  polyhedron  maps  Into  one  and  only  one  convex  polyhedron. 

Since  no  two  faces  of  a  convex  polyhedron  can  have  parallel  normals,  a  spherical 
Image  consisting  of  n  distinct  points  corresponds  to  a  convex  polyhedron  of  n  faces  Thus, 
an  ESI  of  any  convex  polyhedron  consists  of  exactly  the  Information  which  specifies  the 
outward  normal  and  surface  area  for  each  face  of  the  polyhedron  By  Minkowski's  Theorem, 
we  conclude  that  an  ESI  of  this  type  represents  one  and  only  one  distinct  polyhedron 
Q.E.D 


Thus  It  is  shown  that  the  ESI  representation  may  at  least  be  useful  In  representing 
convex  pdyhedra  The  reader  should  observe,  however,  that  certain  anomalies  arise  if  the 
objects  are  not  convex.  In  fact,  for  any  ESI  which  corresponds  to  a  real  convex 
polyhedron,  there  is  an  infinite  set  of  polyhedron-like  objects  with  holes  in  their  faces  which 
would  map  Into  the  Identical  ESI.  One  can  also  conceive  of  infinite  sets  of  LSI's  which  do 
not  correspond  to  any  real  convex  polyhedra  —  simply  permute  the  intensity  or  position  of 
one  of  the  points  of  an  ESI  which  does  represent  a  real  polyhedron  1 


1.  if  m*  rMd»  fkMa  *»•  »*fioun  lo  eor>cap»uall/a,  tha  Cantar  o’  Matt  Thaoram  Infroducaa  m  1f«a  na»t  taction  wlH  pro  via*,  a 
mora  formal  juati’lcattoo  for  lt*«  ttatamant 
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Another  Interesting  class  of  objects  Is  that  which  produces  open  ESI'a.  An  open  ESI 
Is  one  in  which  all  points  of  the  Image  lie  on  a  single  great  circle  of  the  Gaussian  sphere.  A 
closed  ESI  is  one  which  is  not  open,  for  example,  figure  (A)  below  could  be  thought  to 
represent  an  "open-ended"  box.  while  figure  (B)  could  represent  an  "open-ended" 
triangular  tube 


For  exa  >ples  of  this  type,  the  "length”  of  the  corresponding  object  Is  Inversely  proportional 
to  its  cross-sectional  area,  though  neither  measure  Is  fixed  by  the  ESI.'  We  proceed  now 
by  developing  some  mechanics  towards  the  utilization  of  this  enhanced  representation. 

Center  of  Mass  Property 

Having  observed  that  not  all  ESI's.  which  consist  solely  of  points,  actually  represent 
real  polyhedra.  one  can  appreciate  the  need  for  a  simple  test  to  determine  the  validity  of  an 
ESI.  As  it  shall  bn  seen,  the  test  presented  here  is  not  only  useful  In  verifying  a  correctly 
formed  ESI,  but  It  also  proves  essential  to  the  reconstruction  of  an  object  from  Its  enhanced 
mapping  onto  the  Gaussian  sphere. 

1  It  i*  ,Mora.t*e  "o'*  **'  an  opon  EM  common*  lo  an  obfocl  wboaa  EiAor  cbacactoftattc  la  not  *9*1  to  2.  M  l»  !*• 
COM  *o»  an  ciotod  otjOcta.  Alto,  a  doaad  ESI  common*  to  a  boonrtod  coova*  body.  l  a.,  ana  v»Mcn  may  ba  anctoaaS  by  a 
mbara  of  fmito  cadbrt 
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Definition:  Center  of  Mass  of  an  ESI. 

The  center  of  mass.  CM,  of  an  FSI  is  analogous  to  the  center  of 
mass  for  a  system  of  point  masses.  The  points  of  the  system  are  the  FSI 
points,  the  positions  aro  their  coordinates  on  the  Gaussian  sphere,  and 
their  masses  are  the  intensities  of  the  spherical  image  points  Thus, 
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where  (X|,  yj.  Z|)  are  the  Cartesian  coordinates  of  the  i**1  ESI  point  and 
Is  Its  Intensity. 


m. 


With  this  definition,  we  may  now  state  the  following  valuable  theorem. 


Center  of  Mass  Theorem. 

An  ESI  corresponds  to  a  real  convex  object  if  and  only  if  Its 
center  of  mass  Is  at  the  origin  of  the  Gaussian  sphere  and  the  ESI  is 
closed. 


Proof 


Dividing  the  Gaussian  sphere  along  one  of  Its  great  circles,  we  obtain  two 
hemispheres  Let  ty  be  the  outward  normal  of  the  sphere  which  Is  perpendicular  to  the  plane 
of  the  great  circle  Then  the  component  of  the  total  area  of  the  convex  object  which  faces 
In  the  direction  of  Is  the  sum  of 

m(  •  cos(ej) 

over  the  ESI  points  on  the  hemisphere,  where  m(  Is  the  intensity  of  the  point  and  e(  Is  the 
angle  between  and  the  normal  to  the  sphere  at  the  i*h  point  If  the  component  of  the  total 
area  which  faces  the  -Jj  direction  (represented  on  the  opposite  hemisphere)  is  not  equal  to 
this  sum,  then  the  represented  object  cannot  be  closed,  and  thus  cannot  be  convex.  If  all 
pairs  of  hemispheres  do  represent  equal  areas,  then  all  of  the  point  masses  are  balanced, 
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and  the  center  of  mass  must  be  at  the  center  of  the  Gaussian  sphere.  Q.E.D 

Now  that  the  ESI  has  been  shown  to  represent  a  convex  polyhedron  uniquely  .md 
that  an  ESI  may  be  tested  for  correspondence  to  a  real  polyhedron,  we  may  now  tackle  the 
problem  of  reconstructing  an  object  from  its  FSI  representation. 

Adjacency,  Reduction,  and  Augmentation 

Unfortunately,  the  Information  provided  by  the  ESI  of  a  polyhedron  does  not 
immediately  disclose  which  of  the  represented  faces  are  adjacent  to  each  other. '  This,  of 
course,  would  have  greatly  simplified  the  task  of  reconstruction.  We  have  not,  however, 
lost  all  Information  about  the  adjacency  of  the  faces,  as  shall  be  proven  later.  Before  we 
Invest  the  effort  of  doing  this,  let  us  examine  what  we  stand  to  gain  The  following 
definition  points  us  in  the  right  direction. 

Definition  Adjacent  points  of  ESI's. 

Two  points  on  a  Gaussian  sphere  are  adjacent  If  one  of  them  haa 
no  closer  neighbors  (In  terms  of  the  arc  length  between  them)  than  the 
other. 

Now  recalling  that  the  minimum  number  of  faces  for  a  real  polyhedron  Is  four  (a  tetrahedron) 
and  therefore  that  the  ESI  of  a  real  polyhedron  must  have  at  least  four  points,  we  are  lead 
to  the  following  corollary  to  the  Center  of  Mass  Theorem 


1.  Th*  r»*d*r  may  com/inca  Nmiai*  of  thu  fact  by  con* Waring  tha  ESr*  corraapondbig  to  a  oub*  with  on*  of  It*  oomar* 
•I load  off  Depending  on  the  n/a  of  th*  removed  portion,  th#  now  faca  will  leva  S,  4,  or  0  edge*  i  t  long  a*  th*  dbactlon  of 
tho  aafaoa  r nrmal  l»  kepi  constant  (i  a  .  all  of  th*  sllca*  ar*  m  partllal  plana*),  than  th*  position*  of  tha  point*  In  tha  ESI  will 
aof  chang*.  tfmj^i  th*  adjacent  faca*  may 
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Corollary  of  Reduction. 

For  any  FSI  which  consists  of  n  points  (n  >  4)  and  which 
corresponds  to  a  real  convex  polyhedron  [n-hedron],  any  pair  of 
adjacent  pointa  may  be  replaced  by  a  single  point  such  that  the  resulting 
ESI  also  corresponds  to  a  rual  convex  polyhedron  [(n-1  )-hedron].  We 
shall  say  that  the  n-hedron  Is  thus  reduced  to  an  (n-1  )-hedron. 

Proof 

The  corollary  follows  immediately  from  the  Center  of  Maas  Theorem  if  it  is  always 
poeeibie  to  replace  two  adjacent  points  by  a  single  point  such  that  the  position  of  the 
center  of  mesa  for  the  system  of  points  is  preserved.  We  now  show  that  such  a  point  may 
always  be  found. 

Two  points  on  the  surface  of  a  sphere  of  radius  R  specify  a  great  circle  of  the 
sphere.  To  preserve  the  center  of  mass,  the  new  point  must  lie  along  the  arc  connecting 
the  two  adjacent  points.  The  problem  is  thus  transformed  into  one  of  two  dimensions  Polar 
coordinates  are  used  here: 
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To  complement  the  Corollary  of  Reduction,  one  may  also  give  an  Inverse  construction 
with  an  inverse  proof.  This  would  be  the  Corollary  of  Augmentation,  and  we  shall  say  that  an 
(n-l)-hedron  Is  thus  augmented  to  an  n-hedron.  It  may  be  noticed  that  this  construction  is 
analogous  to  the  augmentation  of  nets  (see  p.  66  of  1 1 1]) 

Theorem  of  Adjacency 

Having  the  above  corollaries,  one  might  now  discern  the  Intended  plan  of  attack  -- 
hopefully  ending  at  the  ultimate  goal  of  a  reconstruction  algorithm.  The  relationship  between 
adjacent  points  of  an  ESI  and  the  adjacent  faces  of  the  corresponding  polyhedron  is 
revealed  by  the  following  theorem  and  shall  play  a  vital  role  in  achieving  our  goal. 

Theorem  of  Adjacency. 

Adjacent  points  In  an  ESI  are  the  mappings  of  adjacent  faces  in 
the  corresponding  convex  polyhedron,  l.e.,  the  faces  have  a  common 
edge. 

Proof. 


Assume  that  a  pair  of  adjacent  points  corresponds  to  a  pair  of  faces,  I  and  j,  of  a 
polyhedron  which  are  not  adjacent.  Then  there  exists  at  least  one  face,  k,  between  these 
two  faces,  and  the  direction  of  k's  normal  vector  must  differ  from  that  of  I  more  than  the 
direction  of  J’a  normal  differs  from  that  of  I.  However,  this  Is  possible  only  If  one  of  the 
following  is  the  case 


1.  There  is  only  one  face  between  the  faces  I  and  J.  and  It  has  at 
least  one  edge  thet  Is  not  the  edge  of  any  other  face,  or 


2.  There  are  at  least  two  faces  between  the  faces  I  and  J,  and  at 
least  one  of  the  inward  facing  angles  between  two  adjacent 
facea  Is  greater  than  180  degrees. 

In  either  case,  the  polyhedron  is  not  convex.  Thus,  the  theorem  is  proven  by  contradiction. 


Theorems  of  Adjacent  Reduction  and  Adjacent  Augmentation 


Combining  the  Theorem  of  Adjacency  with  the  Corollaries  of  Reduction  and 
Augmentation  produces  the  final  two  tools  needed  for  specifying  how  to  reconstruct  a 
polyhedron  from  Its  ESI. 


Theorem  of  Adjacent  Reduction. 

When  an  ESI  of  n  points  is  reduced  to  n-1  points,  say  by  replacing 
adjacent  points  I  and  J  with  point  k,  the  set  of  faces  of  the  (n-l)-hedron 
which  are  adjacent  to  face  k  are  the  union  of  those  that  were  adjacent 
to  the  faces  I  and  J  in  the  n-hedron  (excluding  I  and  J). 


The  proof  follows  from  the  definition  of  convex  polyhedra  and  the  Corollary  of  Reduction. 


Theorem  of  Adjacent  Augmentation. 

When  the  faces  adjacent  to  a  given  face  k  of  an  (n-l)-hedron  are 
known,  the  face  k  may  be  augmented  into  2  faces  I  and  J,  thus  forming  an 
n-hedron.  The  faces  adjacent  to  face  I  are  those  whose  edges  of 
Intersection  with  k  now  fall  on  i's  side  of  the  edge  between  I  and  J  (plus 
face  J),  and  similarly  for  J. 


The  proof  follows  from  the  definition  of  convex  polyhedra  and  the  Corollary  of  Augmentation 
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Reconstructing  a  Tetrahedron 

We  digress  momentarily  to  examine  the  "limiting''  case,  the  tetrahedron,  which  is  the 
polyhedron  with  the  fewest  number  of  faces 


Consider  ihe  following  property  of  triangles  (which  compose  the  faces  of  tetrahedra): 
a  triangle  is  uniquely  specified  by  the  length  of  its  three  sides.  It  may  also  be  shown  that  a 
triangle  is  uniquely  determined  by  its  area  and  Its  three  interior  angles  by  deriving  the 
lengths  of  the  sides  from  this  information. 
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Since  the  ESI  of  a  tetrahedron  provides  exactly  the  Information  needed  to  denote  the 
Interior  angles  and  area  of  each  face,  and  since  the  faces  are  all  adjacent  to  each  other,  a 
tetrahedron  may  be  reconstructed  from  Its  ESI  as  follows: 


Tetrahedron  Reconstruction  Algorithm 


1 .  Choose  one  point  of  the  ESI  (a  face  of  the  tetrahedron). 
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2.  Determine  the  angles  of  intersection  of  the  planes  represented 
by  the  other  three  points.  These  give  the  interior  angles  of  the 
triangle  which  is  the  chosen  face. 

3.  Calculate  the  lengths  of  the  sides  of  the  chosen  triangle. 

4.  Construct  the  triangle  on  the  plane  specified  by  the  chosen 
point. 

6.  Position  the  other  three  faces  such  .‘hat  their  edges  of 
Intersection  with  the  chosen  face  coincide  with  the  sides  of  the 
triangle. 


Since  the  positions  of  all  four  faces  are  then  fixed,  the  reconstruction  is  complete  This 
special  case  for  tetrahedron  will  serve  as  the  basis  in  the  restoration  of  more  general 
potyhedra. 


Poiyhedra  Reconstruction 


At  last,  we  have  reached  our  goal  Any  polyhedron  may  be  reconstructed  from  Its  FSI 
as  follows: 


Poiyhedra  Reconstruction  Algorithm. 

1.  Recursively  reduce  the  ESI,  "remembering"  each  pair  of 
adjacent  points  which  get  replaced,  until  it  consists  of  only  four 
points. 

2.  Reconstruct  the  tetrahedron  which  corresponds  to  this  ESI. 

3.  Successively  augment  the  faces  In  the  reverse  order  of  the 
reductions  using  the  remembered  points  to  choose  the 
augmentation. 


This  algorithm  may  be  shown  to  work  correctly  by  induction  on  the  number  of  points  In  the 
ESI.  The  aeries  of  figures  below  give  a  pictorial  example  of  this  reconstruction  algorithm  In 
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use 


Smooth  Convex  Objects  and  Non -convex  Objects 

For  this  technique  of  representing  three-dimensional  objects  to  be  of  practical  use,  it 
must  be  able  to  model  more  general  objects  than  simply  convex  polyhedra.  Since  the 
theorems  and  algorithms  developed  so  far  are  all  Inherently  dependent  on  the  properties  of 
polyhedra,  the  Initial  temptation  Is  therefore  to  denote  smooth  convex  objects  as 
many-faced  polyhedra.  This  approach  suggests  examining  the  limiting  situation  as  the  area 
of  each  little  face  then  tends  to  zero.  With  any  luck  at  all,  doing  this  study  will  reveal  a 
more  general  spherical  Image  representation  to  be  utilized  In  the  cases  of  non-zero 
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curvature. 

Non-convex  objects  pose  a  potentially  more  frustrating  problem,  since  even 
r*on-convex  polyhedra  caiinot  be  represented  unambiguously  by  FSI's.  For  example,  the 
following  two  objects  have  identical  mappings  onto  the  Gaussian  sphere 


One  might  consider  treating  non-convox  objects  as  two  or  more  convex  objects  with 
common  planar  faces  This  Involves  using  several  Gaussian  spheres  to  represent  a  single 
object,  and  some  mechanism  for  combining  reconstructed  objects  would  then  be  required 
Interestingly  enough,  once  we  have  crossed  the  thre«nold  of  focusing  attention  on  more 
than  a  single  FSI  at  any  one  time,  the  possibility  of  representing  scenes  of  objects  by  a  set 
of  "linked"  Gaussian  spheres  then  presents  Itself  At  this  time,  none  of  those  extended 
schemes  have  been  Investigated,  but  they  should  certainly  be  considered  in  future 
research  Involving  enhanced  spherical  images. 

Conclusion 

An  original  motive  for  developing  a  representation  employing  surface  normals  was  to 
facilitate  computational  manipulation  of  a  tin  nr- dimensional  object's  description  Using  the 
spherical  model  developed  here,  rotations  ai>out  an  arbitrary  axis  are  analogous  to  spinning 
a  sphere  —  all  points  on  its  surface  rr  at  the  same  time.  This  Is,  of  course,  a  result  of 
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using  spherical  coordinates,  where  rotation  Is  effected  by  adding  appropriate  offsets  to 
each  point's  altitude  and  azimuth  angles.  A  more  significant  advantage  of  using  FSI's  would 
become  apparent  when  object  recognition  is  attempted.  Though  an  algorithm  has  not  yet 
been  developed,  the  matching  process  should  be  greatly  simplified.  Tasks  Involving  spatial 
reasoning  (as  a  mechanical  manipulator  might  require),  scene  analysis,  and  image 
reconstruction  would  all  bonefit  from  having  an  object's  surface  normal*  readily  available. 

It  has  been  demonstrated  In  this  paper  that  the  Fnhanced  Spherical  Image 
representation  is  feasible.  Suggestions  have  been  offered  for  future  research  and  possible 
applications  It  is  hoped  that  these  further  investigations  will  support  the  practicality  of 


this  representation  and  that  Implementation  will  prove  useful 
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